Translation Inference across Dictionaries via a Combination of Graph-based Methods and Co-occurrence Statistics

نویسندگان

  • Thomas Proisl
  • Philipp Heinrich
  • Stefan Evert
  • Besim Kabashi
چکیده

This system description explains how to use several bilingual dictionaries and aligned corpora in order to create translation candidates for novel language pairs. It proposes (1) a graph-based approach which does not depend on cyclical translations and (2) a combination of this method with a collocation-based model using the multilingually aligned Europarl corpus.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Using Machine Learning for Translation Inference Across Dictionaries

This paper describes our contribution to the closed track of the Shared Task Translation Inference across Dictionaries (TIAD2017), held in conjunction with the first Conference on Language Data and Knowledge (LDK-2017). In our approach, we use supervised machine learning to predict high-quality candidate translation pairs. We train a Support Vector Machine using several features, mostly of the ...

متن کامل

CO-graph: A new graph-based technique for cross-lingual word sense disambiguation

In this paper, we present a new method based on co-occurrence graphs for performing Cross-Lingual Word Sense Disambiguation (CLWSD). The proposed approach comprises the automatic generation of bilingual dictionaries, and a new technique for the construction of a co-occurrence graph used to select the most suitable translations from the dictionary. Different algorithms that combine both the dict...

متن کامل

Query Translation Disambiguation as Graph Partitioning

Resolving ambiguity in the process of query translation is crucial to cross-language information retrieval when only a bilingual dictionary is available. In this paper we propose a novel approach for query translation disambiguation, named “spectral query translation model”. The proposed approach views the problem of query translation disambiguation as a graph partitioning problem. For a given ...

متن کامل

Using co-occurrence tendencies to improve Cross-Language Information Retrieval

Query disambiguation is considered as one of the most important methods in improving the effectiveness of information retrieval. In the present paper, we focus on query terms disambiguation via, a combined statistical method both before and after translation, in order to avoid source language ambiguity as well as incorrect selection of target translations. By combining query expansion with dict...

متن کامل

Context-Sensitive Inference Rule Discovery: A Graph-Based Method

Inference rule discovery aims to identify entailment relations between predicates, e.g., ‘X acquire Y à X purchase Y’ and ‘X is author of Y à X write Y’. Traditional methods discover inference rules by computing distributional similarities between predicates, with each predicate is represented as one or more feature vectors of its instantiations. These methods, however, have two main drawbacks....

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017